Learning Inter-Related Statistical Query Translation Models for English-Chinese Bi-Directional CLIR
نویسندگان
چکیده
To support more precise query translation for English-Chinese Bi-Directional Cross-Language Information Retrieval (CLIR), we have developed a novel framework by integrating a semantic network to characterize the correlations between multiple inter-related text terms of interest and learn their inter-related statistical query translation models. First, a semantic network is automatically generated from large-scale English-Chinese bilingual parallel corpora to characterize the correlations between a large number of text terms of interest. Second, the semantic network is exploited to learn the statistical query translation models for such text terms of interest. Finally, these inter-related query translation models are used to translate the queries more precisely and achieve more effective CLIR. Our experiments on a large number of official public data have obtained very positive results.
منابع مشابه
English-Chinese Bi-Directional OOV Translation based on Web Mining and Supervised Learning
In Cross-Language Information Retrieval (CLIR), Out-of-Vocabulary (OOV) detection and translation pair relevance evaluation still remain as key problems. In this paper, an English-Chinese Bi-Directional OOV translation model is presented, which utilizes Web mining as the corpus source to collect translation pairs and combines supervised learning to evaluate their association degree. The experim...
متن کاملEnglish-Chinese Cross-Language Retrieval based on a Translation Package
An inexpensive COTS translation package, augmented with a downloadable bilingual dictionary, was employed for a study of English-Chinese cross-language information retrieval (CLIR) using the query translation approach. The experimental setting involved the 170 MB Chinese collections and 54 queries of TREC and their relevance judgment, and our PIRCS bi-lingual retrieval system. With some standar...
متن کاملSimple Query Translation Methods for Korean-English and Korean-Chinese CLIR in NTCIR Experiments
in goal of our participation in the NTCIR op is to evaluate relatively simple yet al methods for CLIR using Korean queries lish and Chinese documents. We employed ary-based query translation methods for cases but with different translation ity resolution techniques. The KoreanCLIR was quite successful, but the -Chinese CLIR resulted in unexpectedly rformance. While our analysis is still in s, w...
متن کاملDCU's Experiments for the NTCIR-8 IR4QA Task
We describe DCU’s participation in the NTCIR-8 IR4QA task [16]. This task is a cross-language information retrieval (CLIR) task from English to Simplified Chinese which seeks to provide relevant documents for later cross language question answering (CLQA) tasks. For the IR4QA task, we submitted 5 official runs including two monolingual runs and three CLIR runs. For the monolingual retrieval we ...
متن کاملTREC-9 CLIR Experiments at MSRCN
In TREC-9, we participated in the English-Chinese Cross-Language Information Retrieval (CLIR) track. Our work involved two aspects: finding good methods for Chinese IR, and finding effective translation means between English and Chinese. On Chinese monolingual retrieval, we investigated the use of different entities as indexes, pseudorelevance feedback, and length normalization, and examined th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011